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(57) Abstract 

Hic present invention relates to a method for preparing positive polypeptide variants by shuffling different nucleotide sequences 
of homologous DNA sequences by in vivo rccombmation compristog the steps of a) fonning at least one circular plasmid comprising a 
DNA sequence encoding a polypcptode. b) opening said circular plasmid(s) within the DNA scqucncc(5) encoding the polypeptidc(s) c) 
prepanng at least one DNA fra^ent comprising a DNA sequence homologous to at least a part of the polypeptide coding rcrion on at leas 
Zl^l^L^T^ P f^^^'ilK'^^i!??^*^'"^ "^"^ plasmid(s). together with at least one of said homologous DNA 

fragment(s) covcrmg full-length DNA sequences encoding said polypeptide(s) or parts thereof, mto a recombination host cell, e) cultivating 
said recombination host cell, and f) screening for positive polypeptide variants, tuiuvaung 
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Title: Method for preparing polypeptide variants 
FIELD OF THE INVENTION 

5 

The present invention relates to a method for preparing polypeptide 
variants by in vivo recombination. 

10 BACKGROUND OF THE INVENTION 

The advantages of producing biologically active polypeptides by 
cloning naturally occurring DNA sequences from microorganisms, such 
as fungal organisms, and bacteria using recombinant DNA technology 
15 have been known for quite some years. 

Preparation of novel polypeptide variants and mutants, such as novel 
modified enzymes with altered characteristics, e.g. specific activ- 
ity, substrate specificity, pH-optimum, pi, Kp, Vn^x etc., have 
20 especially during the recent years diligently and successfully been 
used for obtaining polypeptides with improved properties. 

For instance, within the technical field of enzymes the washing 
and/or dishwashing performance of e.g. proteases, lipases, amylases 
25 and cellulases have been improved significantly. 

In most cases these improvements have been obtained by site-directed 
mutagenesis resulting in substitution, deletion or insertion of 
specific amino acid residues which have been chosen either on the 
30 basis of their type or on the basis of their location in the second- 
ary or tertiary structure of the mature enzyme (see for instance US 
patent no. 4,518,584). 

An alternative general approach for modifying proteins and enzymes 
35 have been based on random mutagenesis, for instance, as disclosed in 
US 4,894,331 and WO 93/01285 

As it is a cumbersome and time consuming process to obtain po- 
lypeptide variants or mutants with improved functional properties a 
4 0 few alternative methods for rapid preparation of modified 
polypeptides have been suggested. 
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Despite the existence of the above methods there are still need for 
even better iterative in vivo recombination methods for preparing 
novel positive polypeptide variants. 



SUMMJiRY OF THE IKVENTION 

The object of the present invention is to provide an improved method 
■ for preparing positive polypeptide variants by an in vivo 
recombination method. 

The inventor of the present invention have surprisingly found that 
such positive polypeptide variants may advantageously be prepared by 
shuffling different nucleotide sequences of homologous DNA sequences 
by in vivo recombination comprising the steps of 

a) forming at least one circular plasmid comprising a DNA sequence 
encoding a polypeptide, 

b) opening said circular plasmid(s) within the DNA sequence (s) 
encoding the polypeptide (s) , 

c) preparing at least one DNA fragment comprising a DNA sequence 
homologous to at least a part of the polypeptide coding region on at 
least one of the circular plasmid (s), , d) introducing at least one 
of said opened plasmid (s), together with at least one of said 
homologous DNA fragment (s) covering full-length DNA sequences 
encoding said polypeptide (s) or parts thereof, into a recombination 
host cell, 

e) cultivating said recombination 'host cell, and 

f) screening for positive polypeptide variants. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the yeast expression plasmid pJS026 comprising DNA 

sequence encoding the HuraicoJa lanuginosa lipase gene. 

Figure 2 shows the yeast expression plasmid pJS037, comprising DNA 

sequence encoding the Humicola lanuginosa lipase gene containing 

twelve additional restriction sites. 

Figure 3 shows the plasmid pJS026. 

Figure 4 shows the plasmid pJS037. 

Figure 5 shows the in vivo recombination of the 0.9 kb synthetic 
wild-type Humicola lanuginosa lipase with pJS037 using Saccharomyces 
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Figure 9 shows an overview of . 
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pria^rs used for the PCR fra™!! ° '^^^'^ The 
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for preparing Posi tivrpolre-tld!" '° °" ^P"^"""" »a">od 

recon-bination method. ' * """"" an iterative in vivo 

The inventor of the 

efficient method for shlffUn'TT '"^^ -^singly fc„„, „ 

Vivo reco^ination system usin/ a ^a"'"" ^" " ^» 

host cell., ^ ^ eukaryotic cell as a ,-0^- w 

^ recombination 
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A "recombination host cell" is in the context of the present 
invention a cell capable of mediating shuffling of a number of 
homologous DNA sequences. 

The term "shuffling" means recombination of nucleotide sequence (s) 
between two or more homologous DNA sequences resulting in output DNA 
sequences (i.e. DNA sequences having been subjected to a shuffling 
cycle) having a number of nucleotides exchanged, in comparison to 
the input DNA sequences {i.e. starting point homologous DNA 
sequences) . 

An important advantage of the invention is that mosaic DNA sequences 
with multiple replacement points or replacements, not related to the 
opening site, is created, which is not discovered in Pompon's 
method. 

An other important advantage of the present invention is that when 
using a mixture of fragments and opened vectors (in the screening 
set up) it gives the possibility of many different clones to 
recombine pairwise or even triplewise (as can be seen in a couple of 
examples below) . 

The in vivo recombination method of the invention simple to perform 
and results in a high level of mixing of homologous genes or 
variants. A large number of variants or homologous genes can be 
mixed in one transformation. The mixing of improved variants or wild 
type genes followed by screenings increases the number of further 
improved variants manyfold compared to doing only random 
mutagenesis. 

Recombination of multiple overlapping fragments is possible with a 
high efficiency increasing the mixing of variants or homologous 
genes using the in vivo recombination method. An overlap as small as 
10 bp is sufficient for recombination which may be utilized for very 
easy domain shuffling of even distantly related genes. 

The invention relates to a method for preparing polypeptide variants 
by shuffling different nucleotide sequences of homologous. DNA 
sequences by in vivo recombination comprising the steps of 
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a) forming at least one circular plasMH 

encoding a polypeptide, c°ntprxsing a dna sequence 

elorg'l ^^-^^^ the se,uence(s) 

c) preparing at least one DNA fr,r™, 

h^olo^ous to ua« a part o. ZZ'l^T;'"" ' 

least one o£ the circular pUs^^.l^ 'TT. °» « 

ho.olo,cus OKA fra,™ent,», covering " °' -id 

" -^■'O"^ said poI„eptiae,s, or parts the" J ""^ ~ 

cell. "to a recombination 

e) cultivating said recombination host cell and 
' POl^ptide varL;ts 

" - - =.cle o. step a, to . 

The opening of the plasmid{s) in steo 

-te Within the polypeptide codino ' ^^"^ 

2° PWdCs, ^y ,e opened by „y 3u"^^ Plas^d. J 

-he opened ends of the Plas«id Jy ^ Iirr^'^ art. 
described i„ po„po„ et al. a9B9) ^"^^^'^"^ with nucleotides as 
- i« opened ends as it^^Z' c:r a ^^^r^ 
It is preferred to open the r,i 

polypeptide coding ^^^ seJ^IJ^^TZ^^^^^^^^^^ - the 

a -re elective recombination between '^''^^^^ "-^^ 

Plasmad(s). ^^^"«en dna fragment (s) and opened 

^° In an embodiment of the • 

prepared under conditions resrtrrin?/"* ^"'™«<»' i^lare, 
"tagenesis frequency. ' " ' ■»«^'» or high random 

3S - ""^atnTrr;;.^^^^^^^^^^ « i~<=. .comprising 
'^I'- <I988), Science 239, 487 - 

A medium or high mutagenesis fr.rr, 
° -e PCH ampUfieatio: Zr ZZllT/ I ""^"^^ — ^"g 
-corporation of nucleotides, Jr tn/ri ' the mis- 

-stance as described by Deshler, 
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(1992), GATA 9(4), 103-106; Leung et al., (1989), Technique, Vol. 1 
No. 1, 11-15. 

It is also contemplated according to the invention to combine the 
PGR amplification (i.e. according to this embodiment also DNA 
fragment mutation) with a mutagenesis step using a suitable physical 
or chemical mutagenizing agent, e.g., one which induces transitions, 
transversions, inversions, scrambling, deletions, and/or insertions.' 

In the context of the present invention the term "positive poly- 
peptide variants" means resulting polypeptide variants possessing 
functional properties which has been improved in comparison to the 
polypeptides producible from the corresponding input DNA sequences. 
Examples, of such improved properties can be as different as e.g. 
biological activity, enzyme washing performance, antibiotic resis- 
tance etc. 

Consequently, which screening method to be used for identifying 
positive variants depend on the desired improved property of the 
polypeptide variant in question. 

If, for instance, the polypeptide in question is an enzyme and the 
desired improved functional property is the wash performance, the 
screening in step f) may conveniently be performed by use of a 
filter assay based on the following principle: 

The recombination host cell is incubated on a suitable medium and 
under suitable conditions for the' enzyme to be secreted, the medium 
being provided with a double filter comprising a first protein- 
binding filter and on top of that a second filter exhibiting a low 
protein binding capability. The recombination host cell is located 
on the second filter. Subsequent to the incubation, the first filter 
comprising the enzyme secreted from the recombination host cell is 
separated from the second filter comprising said cells. The first 
filter is subjected to screening for the desired enzymatic activity 
and the corresponding microbial colonies present on the second 
filter are identified. 

The filtex used for binding the enzymatic activity may be any 
protein binding filter e.g. nylon or nitrocellulose. The topfilter 
carrying the colonies of the expression 'organism may be any filter 
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that has no or low affinity for h- ^• 
acetate or DuraporeO. The filter L L "' 

conditions to be used for screenZ o "'^-'^^^^^^ "^^^ -y of the 
detection of enz^onatic activity "^^^^^ 

The enzj«,tic activity may be detected h 

P«cipltati„n, pH indicator, IR-abrr, ' """"""ce, 
tecKni^e .or detection o. en^ Jtic a^;. °' 

10 The detecting compound „«y be i™,obiUzed bv anv • 

'■S. agarose, agar, gelatine, polyacrylald ™=""^ing agent 

cicth, or any combination o. i^obuSrate^tr""*' 

If the improved functional property of 
« sufficiently good after one cydrof .s P°^">=P"<ie is not 

be .Objected to another cycle "'^""""S, the polypeptide may 

m an embodiment of the invention .f i 

bac.croa.ing cycle with the initiallv ' 
20 be the „ild-type Dm fragment This , ' ""^ ""i^" 

"ons. Non-essential mutatZa Jy "./o ^''-"-""al ~ta- 

'^e 0» fragmenta aa the ^^..Z'^^^rtC rte^ar' 

« - -"^ta^a^l^^^^^^^^^^^^^ - - - suitable 

O'^^'ses. ' ""yl"", celluiaaes, peroxidases and 

Also contemplated accordino ^n »k • 

30 biological activity such a' ins^i^'rf" ^ "-l-g 
somatotropin, thymosin, parathyroid K ^""^'"statin, 
somatomedin. erythropoietin T„! P''"™'"/ hormones, 

gonadotropin, hypothalLc releasing fact""' """"""^ 
thyroid stimulating hormone, rela'in i'^ r"'"'""^ 

35 (TPO) and prolactin. "-'"feron. thrombopoietin 

Especially cont«„piated according to the 

inztxally to use input Dm sequences bL^ ■ T^^"' " 
or modified Dm seguences, such as a " 
typa, variant or modified en.™es r. 

•nzymes exhibiting lipcij.,i, '^^^■''"'^ly. in particular 
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In an embodiment of the invention the lipolytic activity is a lipase 
activity derived from the filamentous fungi of the Humicola sp., in 
particular Humicola lanuginosa, especially Humicola lanuginosa. 

In a specific embodiment of the invention the initially used input 
DNA fragment to be shuffled with a homologous polypeptide is the 
wild-type DNA sequence encoding the Humicola lanuginosa lipase 
derived from Humicola lanuginosa DSM 4109 described in EP 305 216 
(Novo Nordisk A/S) . 

Also specifically encompassed by the scope of the invention is input 
DNA sequences selected from the group of vectors (a) to (f) and/or 
DNA fragments (g) to (aa) coding for Humicola lanuginosa lipase 
variants from the list below in the Material and Method section. 

Throughout the present application the name Humicola lanuginosa has 
been used to identify one preferred parent enzyme, i.e. the one 
mentioned immediately above. However, in recent years H. lanuginosa 
has also been termed Thermomyces lanuginosus (a species introduced 
the first time by Tsiklinsky in 1989) since the fungus show 
morphological and physiological similarity to Thermomyces 
lanuginosus. Accordingly, it will be understood that whenever 
reference is made to H. lanuginosa this term could be replaced by 
Thermomyces lanuginosus. The DNA encoding part of the 18S ribosomal 
gene from Thermomyces lanuginosus (or H. lanuginosa) have been 
sequenced. The resulting 18S sequence was compared to other 18S 
sequences in the GenBank database and a phylogenetic analysis using 
parsimony (PAUP, Version 3.1.1, Smithsonian Institution, 1993) have 
also been made. This clearly assigns Thermomyces lanuginosus to the 
class of Plectomycetes, probably to the order of Eurotiales. 
According to the Entrez Browser at the NCBI (National Center for 
Biotechnology Information) , this relates Thermomyces lanuginosus to 
families like Eremascaceae, Monoascaceae, Pseudoeurotiaceae and 
Trichocomaceae, the latter containing genera like Emericella^ 
Aspergillus, Penicillium, Eupenicillium, P^ecilomyces , Talaromyces, 
Thermoascus and Sclerocleista. 



Consequently, such genes encoding lipolytic enzymes of filamentous 
fungi of the genera Emericeiia, Aspergillus, Penicillium, 
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Eupe,,,,,,,^^ p,,,,,^^^^^^^ ^^^^^ ^ 

Sclerocleista are also specific, n ^^^^oascus and 

present invention. -^'^^^^^--^ contemplated 

according to the 

5 Other examples of relevant- ^-^ 

strains listed in wo 96/13578 ^bsadia sp. e.g. the 

hereby incorporated ^ Zj ^^^^ "^^^ - 

96/13578 include ;^sidia TZlTelT ll' ""^"^ ^^"^^ WO 

Strains of Rhizopus so in « , 

-o conte^iate^ acc:;.:. t^^ ^ ^^^^^^^^ ^ or.ea are 
The lipolytic gene n^y also be derived fr. 

strain of the i>.eudo.o.as sp T °" ' as a 

stut^erl. PS. cepacia and Ps fiuZ ^ragi, Ps. 

Plsn^arU or ... (US 4,9^0 ^7?? .'"^ ^ 

Pseudoalcallgenes (ep 218 2-72 ep ,,, -^"^i^-es and P.. 

0 (disclosing variants of the Ps L.J , ' °' "° 94/25578 

the P.eudo«cnas sp. varrant Tsetlr""^^ ^^^^^^^^^ 
i'seudo^onas sp. lipolytic en.y.e sul \" ^ 
tenned Ps. p^tida, lipolytic e^^e d "^"^^^^ 
^^389 53S or variants^hLe:f as'Tescr/er ^^^^^^^^ ^ 

aurogino^a or Ps. ^la^ae. or Ps svrL "'^ 5,352,594, or Ps. 

96^2012 fro. solvay, or a ^t'rar of'; " ----i-n.i. mo 
su^tlUs described by Dartois et , ^• 
Biophysica acta H31, 253-260 '"^^^ Biochemica et 

e^muS92) or 5. 91/16422^ ^' t^'P 

^P" e.g. s. scabies/ or a str.in . ^ °^ 5trepto;nyces 

viscosuffl. °f ^rojnoZ^acteriun, sp. e.g c. 

In connection with i-v,.. « 

h-ology, such „ at least eoTT , ' ^«srae of 

^-ily Of Upases: P. rc«™T " 
conmerciaUy available as uZJl sp. lipase 

- .s. p«..oaieaii„LTi;::!',;irr'"'" ^ 

' <:epacia DSM 3959 
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Ps. cepacia M-12-33, Ps. sp. KWI-56, Ps. putida IFO 3458, Ps. pvtlda 
IFO 12049 (Gilbert, E. J., (1993), Pseudomonas lipases : Biochemical 
properties and molecular cloning. Enzyme Microb. Technol., 15, 634- 
645) • The species Pseudomonas cepacia has recently been reclassified 
as Burkholderia cepacia, but is termed Ps, cepacia in the present 
application. 

Also genes encoding lipolytic enzymes from yeasts are relevant, ans 
include lipolytic genes from Candida sp., in particular Candida 
rugosa, or Geotrichum sp., in particular Geotrichum candidum. 

Specific examples of microorganisms comprising genes encoding 
lipolytic enzymes used for commercially available products and which 
may serve as donor of genes to be shuffled according to the 
invention include Humicola lanuginosa, used in Lipolase®, Lipolase® 
Ultra, Ps. mendocina used in Lumafast®, Ps. alcaligenes used in 
Lipomax®, Fusarixm solani. Bacillus sp. (US 5427936, EP 528828), 
Ps. mendocina f used in Liposam®. 

Also the Pseudomonas sp. lipase gene shown in SEQ ID NO 14 are 
specifically contemplated according to the invention. 

It is to be emphasized that genes encoding lipolytic enzyme to be shuffled 
according to the invention may bfe any of the above mentioned genes of 
lipolytic enzymes and any variant, modification, or truncation thereof. 
Examples of such genes which are specifically contemplated include the 
genes encoding the enzymes described in WO 92/05249, WO 94/01541, WO 
94/14951, WO 94/25577, WO 95/22615 and a protein engineered lipase variants 
as described in EP 407 225; a protein engineered Ps. mendocina lipase as 
described in OS 5,352,594; a cutinase variant as described in WO 94/14964; 
a variant of an Aspergillus lipolytic enzyme as described in EP patent 
167,309; and Pseudomonas sp, lipase described in WO 95/06720. 

A request to the DNA sequences, encoding the polypeptide (s) , to be 
shuffled, is that they are at least 60%, preferably at least 70%, 
better more than 80%, especially more than 90%, and even better up 
to almost 100% homologous. DNA sequences being less homologous will 
have less inclination to interact and recombine. 

It is also contemplated according to the invention to shuffle parent 
(homologous) wildt type organisms of different genera. 



30 



35 



40 



WO 97/07205 

^2 PCT/DK96/0G343 
Further, the DNA fragment (s) to be shuffl h 

length of f.c. about 20 bp to 8 ^ Tf ' P-^-ably have a 
-re preferred about 80 bp to 4 ' ^° ^ 

• The method Of the invention is very eff • • 
lypept.de variants in comparison to ' Preparing po- 

The inventor found that i-ho .h 

open., pus^, Tz%"::^:r::r "'^^-^^ ^ — 

*en tra„,J„^„, a pXa^^, curat e" "'""'^^""^ "^"^ 

•f«r only ^^^^J^ ««^.„ed DH* se^.„ces were found 

A= described in Exanple 1 sot , 

contained reco*i„ed Dm se^encea of '^^for^nta 

20, Of .K, total n!L,er Of ret " 
«=hanged) . ' ""^ one region of nucleotides 

Iha input Dm sequences may be anv dm. 

ONA =.,u.„=es. Dm sequences enco? "'"""^ ^"=^"^"5 "ild-type 
«dmcations thereof. sTch as e««d^' « "«"-s. "Ir 

-d ^y also be the outcome of Dm set 
" °- -re cycles of shufmnrTi 

according to the „«thod of the llvenl °™ -^"ences, 

- o^ the metho., .scribed i" Tpt: sTt Z^Z' ""''"^ 
When using the method of th. ■ 

"■""^'O »A sequence" harnr ~ 
exchanged.. This results m reWacr ^ ' """^'^ °' ""ciectidefs) 
"ithin the polypeptide vari Jt ™ 1°' " °- -^"0 aci 

Po^^^eptide. It is to be underitoi: ^^^^ - 

Silent inutations is 
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contemplated (i.e. nucleotide exchange which does not result in 
changes in the amino acid sequence) . 

However, the method of the present invention will in most cases lead 
to the replacement of a considerable number of amino acid and may in 
certain cases even alter the structure of one or more polypeptide 
domains (i.e. a folded unit of polypeptide structure). 

According to the present invention more than two DNA sequences are 
shuffled at the same time. Actually any number of different DNA 
fragments and homologous polypeptides comprised in suitable plasmids 
may be shuffles at the same time. This is advantageous as a vast 
number of quite different variants can be made rapidly without an 
abundance of iterative procedures. 

The inventor have tested the nucleotide shuffling method of the 
invention using significantly more than two homologous DNA 
sequences. As described in Example 2 it was surprisingly found that 
the method of the invention advantageously can be used for 
recombining more than two DNA sequences . 

One cycle of shuffling according to the method of the invention may 
result in the exchange of from 1 to 1000 nucleotides into the opened 
plasmid DNA sequence encoding the polypeptide in question. The 
exchanged nucleotide sequence (s) may be continuous or may be present 
as a number of sub-sequences within the full-length sequence (s). 

To support the present inventioti the inventor made a number of 
additional experiments on different aspect on the method of the 
invention. The experiments are described below and illustrated in 
the Example 3 to 6 below. 

A number of vectors and fragments comprising an inactivated 
synthetic Humicola lanuginosa lipase genes were constructed by 
introducing f rameshif t/stop codon mutations in the lipase gene at 
various positions. These were used for monitoring the in vivo 
recombination of different combinations of opened vector (s) and DNA 
fragments. The number of active lipase colonies were scored as 
described in Example 3. The number of colonies determines the 
efficiency of the opened vector (s) and fragment (s) recombination. 
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One frameshift mutation in said ff„m,- , 

the openea vector a„. anothe „te ,^^^7^-" ^^P-» »e„. 
ot the opening site gave 3 to 32, of ao. °" 
depending on the location and colonies 
5 the Close, that the .uta is^Z " " 

higher mixing. °f the vector the 

One frameshift mutation in th. ^ 

fragment on each side of L o .eT " 
" colonies depending on the tcat:n ' " « 

ective colonies can be considered tot r^""'""- °' 
to the opening site. "nosaics, not only related 

" :«r:itinrrt:: :::g::nr - - the 

colonies depending on the loLuTaJTJ:' '° « "'ive 

active colonies are mosaics of'tte CenT::""- 

- ^^i::^:^:::-^ — - sid, of the 

colonies depending on the " « '"ive . 

It was also found that- t-K^ 

j-wuxiu tnat the amount of rr^^4~ 

- 2^'-- - - --nts are ri^^Xn^^tl^^^^^^^^^^ 

ceTs^leTthat"™; ^^^^^-^ as the recomhination host 
type Plasmid(s) and e>cpressed transformed very well with wild 

^ut gave no transforman'ts fal .rrtr 
30 fragments. ^''^ opened vectors and 

The RAD52 function is required for • 

not for unequal sister>strand mii.o. • reconU,ination" (but 

t.e recombination of opened iLr anV:'""''""'^"^ ^^^''^"^ 
classical recombination 

mechanism involve a 

Classical recombination is the r..« w 

the recombination between genes loir^T'"" ^"^^^-^ - 

homologous chromosomes as le^nerf "onsister chromatids of 

^0 HE and Sjonington LS ..Recoil . '""^^ ^^'--^ 
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Volume 1 (eds. Broach JR, Pringle JR and Jones EW) , Cold Spring 
Harbor Laboratory Press, New York. 

Multiple partially overlapping fraqements 

The inventor also tested recombination of multiple partial 
overlapping fragments using the method of the invention. 

The recombination of 2 and 3 partial overlapping fragments into a 
gapped (i.e. that the opening result in cutting out of a little 
part of the gene) vector were tested and gave a high recovery of 
recombined Humlcola lanuginosa lipase gene. The recovery of active 
lipase gene from different combinations of inactivated Humicoia 
lanuginosa genes was tested for the recombination of 2 partial 
overlapping fragments. The tendency was a higher mixing in the 
overlapping region between the 2 fragments in the gapped region 
than in the vector and fragment overlap. 

When recombining many fragments from the same region, the multiple 
overlapping fragment technique will increase the mixing by itself, 
but it is also important to have a . relative high random mixing in. 
overlapping regions in order to mix closely located 
variants/differences. 

An overlap as small as 10 bp between two fragments were found to be 
sufficient to obtain a very efficient recombination. Therefore, 
overlapping in the range from 5 to 5000 bp, preferably from 10 bp to 
500 bp, especially 10 bp to 100 bp is suitable according to the 
method of the invention. 

According to this embodiment of the present invention 2 or more 
overlapping fragments, preferable 2 to 6 overlapping fragments, 
especially 2 to 4 overlapping fragments may advantageously be used 
as input fragments in a shuffling cycle. 

Besides increasing the mixing of genes, this is a very useful 
method for domain shuffling by creating small overlaps between DNA 
fragments from different domains and screen for the best 
combination. 

For instance, in the case of three DNA fragments the overlapping 
regions may be as follows: 
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- the first end of the first f^^^ ^ 

opened pUs.^,, """'^^^ '"^ *i«t end ot the 

- the first end Of the second fragment oveiUps 

the frrst fragment, and the second end of 1 °^ 
5 overlaps the first end of the third fragment '"'^^ 

- the first end of the third frag,,»nt overlaps ,a, 
second end of the second fragment, and th sec „1 
fragment overlaps the second end of the opened pias^^ 

" "ar^'in: ~tl ^eferd ^ " « — « 

ends of the piasmid a"d l"f ^^nr """^ — 

" r f-— = =srarZer:ird.T — 
ptiir— - 0:= £=' r 

polypeptides. However, in such case it 1. encoding 
Plasmids at different sites compulsory to open the 

In an further embodiment of i->,«. • . 
Plasmids and one or more homologous ~:LT ""^ 
starting material to be shufflL C "'^ "^^-^ « the 

Pl.smid<s) and homologous OHA fraom. !,? """"" "P^"''' 
range from 20:1 to 1:S0. pre^abie^r ^/'i tTlT' ^" ""^ 

fragments) with the specific r-o.^ ^° ^-'lO (raol vector:mol 

Of the DNA. concentrations being from 1 pM to 10 M 

The opened plasmids may advantagously be aapoed • 
the overlap between the fragments is deleted Tn ^ ' 
to select for the recombination) . ^ """^"^ ^" 
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Preparing the DNA fragment 

The DNA fragment to be shuffled with the homologous polypeptide 
comprised in an opened plasmid may be prepared by any suitable 
method. For instance, the. DNA fragment may be prepared by PGR 
amplification (polymerase chain reaction), as described above/ of a 
plasmid or vector comprising the gene of the polypeptide, using 
specific primers, for instance as described in US 4,683,202 or Saiki 
et al., (1988), Science 239, 487 - 491. The DNA fragment may also be 
cut out from a vector or plasmid comprising the desired DNA sequence 
by digestion with restriction enzymes, followed by isolation using 
e.g. electrophoresis. 

The DNA fragment encoding the homologous polypeptide in question laay 
alternatively be prepared synthetically by established standard 
methods, e.g. the phosphoamidite method described by Beaucage and 
Caruthers, (1981), Tetrahedron Letters 22, 1859 - 1869, or the 
method described by Matthes et al., (1984), EMBO Journal 3, 801 - 
805. According to the phosphoamidite method, oligonucleotides are 
synthesized, e.g. in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in suitable vectors. 

Furthermore, the DNA fragment may be of mixed synthetic and genomic, 
mixed synthetic and cDNA or mixed genomic and cDNA origin prepared 
by ligating fragments of synthetic, genomic or cDNA origin (as 
appropriate), the fragments corresponding to various parts of the 
entire DNA sequence, in accordance with standard techniques. 

The plasmid 

The plasmid comprising the DNA sequence encoding the polypeptide in 
question may be prepared by ligating said DNA sequence into a 
suitable vector or plasmid, or by any other suitable method. 

Said vector may be any vector which may conveniently be subjected to 
recombinant DNA procedures. The choice of vector will often depend 
on the recombination host cell into which it is to be introduced. 

Thus, the vector may be an autonomously replicating vector, i.e. a 
vector which exists as an extrachromosomal entity, the replication 
of which is independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced into the 
recombination host cell, is integrated in'to the host cell genome and 
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replicated together with the chromosome (s) into k- 

integrated. ^"^o which it has been 

TO facilitate the screening process it is pref.r . 
5 IS an expression vector in which the ONA s " '''' 
polypeptide in ^estion is operably UnKed to 

required for transcription of the dna r! ^''^'^'^^^^ segments 

vector is derived from a piasmid a .'J . ^^P^ession 
»ay contain elements of any'or Tn of theT ^ or 

The term, "operably linked" indicates that th. 

so that they function in concert for thel ' n^Tr ^^""^^^ 
transcription initiates in a promoter and P^^Po^es, e.g. 

,3 ^^'"^"^^ ^'^e polypep'tide L ^3^;."'^ 

The promoter mav be ^nt/ min 

"lay oe any DNA seguence whii-u i_ 

activity in the recombination host cell of t riptional 
from genes encoding proteins su.h . ^''''^ ^"""^ "^^ ^^'i^ed 

heterologous to the host cell ^'^'^^ homologous or 

20 

Examples of suit^i^ promoters for use i„ „, . . 
pro„K,ters fro. yeast glycolytic ,e„,s (Hitz ""'"^'^ 
Ch».. 255, 12073 - 12080, Alber and . " " •^•..<1980,, J. 
«PP1- Ge„. 1. <19 - 434) or alc=h!l TJ^ J- ""l- 

" i» Genetic Engineer „; of „ " 

IHollaender et ,1, eds.,, flenl Press r ^""^ ""^"''^ 
654) pron^ters. '""""U et al., ,1983,, ^^^^^ ^^^^ _ 

30 Ex«ples Of suitable promoters for use in fi, 

cells are, for inst«,ce, the ADH3 .rl l '^^^entous fungus host 
The E«BO J. 4, 20S3 - 20«, „ ' " « "l- (1985), 

useful promoters are those derived from T"''" °' 
TAKA amylase, ;?i,izomucor mielei . '"^''^'""^ ^- --^"^ ■ 

35 tral a-amylase, ./.L^^^d staMe"' "^^^ 

alkaline protease, A. oryzae trio.. ""f^"' -^^P^^^' A- oryzae 
niduians acetamidase. PreferL. '^^""^^-^^ - ^• 

promoters.. ^^"^"^ ^" ^^e TAKA-amylase and gluA 
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The DNA sequence encoding polypeptide in question invention may 
also, if necessary, be operably connected to a suitable terminator; 
such as the human growth hormone terminator (Palmiter et al., pp, 
at. ) or (for fungal hosts) the TPIl (Alber and Kawasaki, o£^ cit.) 
or ADH3 (McKnight et al., o£^ cit. ) terminators. The vector may 
further comprise elements such as polyadenylation signals (e.g. from 
SV40 or the adenovirus 5 Elb region), transcriptional enhancer 
sequences (e.g. the SV40 enhancer) and translational enhancer 
sequences (e.g. the ones encoding adenovirus VA RNAs) . 



The vector may further comprise a DNA sequence enabling the vector 
to replicate in the recombination host cell in question. 
When the host cell is a yeast cell, suitable sequences enabling the 
vector to replicate are the yeast plasmid 2m replication genes REP 
15 1-3 and origin of replication. 

The plasmid pYl can be used for production of useful proteins and 
peptides, using filamentous fungi, such as Aspergillus sp., and 
yeasts as recombinant host cells ( JP06245777-A) . 

20 

The vector may also comprise a selectable marker, e.g. a gene the 
product of which complements a defect in the recombination host 
cell, such as the gene coding for dihydrofolate reductase (DHFR) or 
the Schizosaccharomyces pombe TPI gene (described by P.R. Russell, 
25 (1985), Gene 40, 125-130). • 

Another example of such suitable selective markers are the ura3 and 
leu2 genes which complements the* corresponding defect genes of e.g. 
the yeast strain Saccbaromyces cerevisiae YNG318. 

The vector may also comprise a selectable marker which confers 
resistance to a drug, e.g. ampicillin, kanamycin, tetracyclin, 
chloramphenicol, neomycin, hygromycin or methotrexate. For fi- 
lamentous fungi, selectable markers include amdS , pyrG , arqB , niaP , 
35 sC^ trpC , pyr4 , and DHFR . 

To direct the polypeptide in question into the secretory pathway of 
the recombination host cell, a secretory signal sequence (also known 
as a leader sequence, prepro sequence or pre sequence) may be 
40 provided in the recombinant vector. The secretory signal sequence is 
joined to the DNA sequence encoding the lipolytic enzyme in the 
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correct reading frame. Secretory sianal 

positioned 5. to the DNA sequence encodi '^^"^^"^^ 
secretory signal sequence .ay Z the sT" , ' ^^^yPePtide. The 

the polypeptide in question or .ay b f"! ""^''^ "^^^ 
5 secreted protein. '""^ ^ ^^"e encoding another 

The signal peptide may be naturally occurring • 
functional part thereof, or it Jy le ! 'T"' ^ ^ 

secretion from yeast cells, suitable sian.f P^Ptide. For 

10 to be the a-factor signal .e^.TJ^ TjTTr '''' 

peptide Of .ouse salivary amylase fcf o ^^^"^^ 
(1981), Nature 289, 643-646, " "^^^'^"chle et al 

peptide ,cf. .... 'JZT:.: i^:;r rr~"^ - - 

«-.coi. , ' ^ISS^^^^^^ cell 48, 387-897), 

^5 peptide (cf. wo 87/02670), or the yeast ' ^ ^^^-^ 

"gnal peptide (cf. „. Egel-Mitani et T'^'"'' ' f^^") 

137). g^Mitan. et al., a990), yeast 6, 127- 

For efficient secretion in yeast . 
0 peptide may also be inserted downs'tream 0^?." ^ 
upstream of the DKA sequence encodi" ^^•^^"^ 
The function of the Teadel pept^,! ^^^^^Ptide in question, 

polypeptide to be directed frfm th! T ^-P"««^ 

Oolgi apparatus and further to a sec T^ """' ^° 
into the culture medium U.e export"" ^«««t:ion 
the cell wan or at lea t throTl tt" n ^^^^^P^^^e across 
Periplas.-nic space of the yeast ceU) Th "^""""^ ^he 

yeast a-factor leader (the use of whl ! ""^^ Peptide may be the 
4^546,082, HP 16 201, ep 123 294 1 '^^"^^^^ ^ 

Alternatively, the leader peptide mav hi ^""^ • 

Which is to say a leader p^ pt^rLr / deader peptide, 

leader peptides may, for instancl h Synthetic 
89/02463 or WO 92/11378. constructed as described in WO 

For use in filamentous funoi ^h« • 

protease, a Hu.l.oU 2..uginL 1 « 
preferably .e.i.,, a™„=i, "''"'^ ^'^'"^ 

j^-i-aae, or A. /iiger 
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glucoamylase . 

The recombination host cell 

The recombination host cell, into which the mixture of plas- 
5 itiid/f ragment DNA sequences are to be introduced, may be any 
eukaryotic cell, including fungal cells and plant cells, capable of 
recombining the homologous DNA sequences in question. 



According to prior art prokaryotic microorganisms, such as bacteria 
10 including Bacillus and E. coii; eukaryotic organisms, such as 
filamentous fungi, including Aspergillus and yeasts such as 
Saccharomyces cerevisiae; and tissue culture cells from avian or 
mammalian origins have been suggested for in vivo recombination. All 
of said organisms can be used as recombination host cell, but in 
15 general prokaryotic cells are not sufficiently effective (i.e. does 
hot result in a sufficient number of variants) to be suitable for 
recombination methods for industrial use. 

Consequently, preferred recombination host cells according to the 
20 present invention are fungal cells, such as- yeast cells or filament- 
ous fungi. 

Examples of suitable yeast cells include cells of Saccharomyces sp., 
in particular strains of Saccharomyces cerevisiae or Saccharomyces 

25 kluyveri or Schizosaccharomyces sp.. Methods for transforming yeast 
cells with heterologous DNA and producing heterologous polypeptides 
therefrom are described, e.g. in US 4,599,311, US 4,931,373, US 
4,870,008, 5,037,743, and US 4,845,075, all of which are hereby 
incorporated by reference. Transformed cells may be selected by, 

30 e.g., a phenotype determined by a selectable marker, commonly drug 
resistance or the ability to grow in the absence of a particular 
nutrient, e.g. leucine. A preferred vector for use in yeast is the 
POtl vector disclosed in US 4,931,373. The DNA sequence encoding the 
polypeptide may be preceded by a signal sequence and optionally a 

35 leader sequence, e.g. as described above. Further examples of 
suitable yeast cells are strains of Kluyveromyces, such as K. 
lactis, Hansenula, e.g. H, polymorpha, or Pichia, e.g. P. pastoris 
(cf. Gleeson et al.,(1986), J. Gen. Microbiol. 132, 3459-3465; US 
4,882,279). 
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Examples of other fungal cells are cells of 

Aspergillus sp., Neurospora sp., Fusarll f^ngi, e.g. 

particular strains of or.." T ll:r 

Aspergillus sp. for the expression of p/oteLt' °' 
e.g., EP 272 277, EP 230 023. The tran.. " ^^^^cribed in, 

-ay, for instance, be carried out as 7"'"°"^'^°" °^ ^- o.ysporu. 
(1989), Gene 78, 147-156. 'iescrxbed by Malardier et al., 

In a preferred embodiment of the invprn-,- 

-11 is a cell Of the genus sZT "-"^--ion host 

cerevisiae. ^ Saccharomyces, in particular s. 
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DNA seq upn^o. 

Humicola lanuginosa DSM 4109 derltr^n i • 

derived l.pase encoding DNA sequence. 

Himlcola lanuginosa lipase variants: 
^^i£iants_used for 

Example^ ^ ^° o°en>>H .... .j, 

(a) ES6R,DS7L,190F,D96L,E99K 

(b, ^5^H,O57L,V60M,D62N,S83T,O96P,D102E 

(c) D57G,N94K,D96L,L97M 

(d) E87K,G91A,D96R,I100V,E129K K237M t,., 

(e) E56R, D57G, S58F, D62C, T64R e;7TgL^o^ ''''^ 

(f) E210K ' '^^'^^'^'^^1A'^95L,D96P,K98I,(K237M) 



5NA__frasirients 



variants „..^ 
amplificat:,-on in F.v.n.pi, o. 

(g) S83T,N94K,D96N 

(h) E87K,D96V 

(i) N94K,D96A 

(j) E87K,G91A,D96A 
(k) D167G,E210V 
(1) S83T,G91A,Q249R 
(m) E87K,G91A 

(n) S83T,E87K,G91A,N94K,D96N,D111N 

(O) N73D,E87K,G91A,N94I,D96G 

(P) "^^'^^6V,S83T,E87N,I90N,G91A,D96A,K98R 



_standard ppp 
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(q) S83T, E87K, G91A, N92H, N94K, D96M 

(s) S85P,E87K,G91A, D96L,L97V. 

(t) E87K,I90N,G91A,N94S/D96N,I100T, 

(u) I34V, S54P, F80L, S85T, D96G, R108W, G109V, DlllG, S116P, L124S/ 

5 V132M,V140Q,V141A,F142S,H145R,N162T,I166V,F181P,F183S, 
R205G, A243T,D254G,F262L. 
(V) E56R,D57L,I90F,D96L,E99K 
(X) E56R,D57L,V60M,D62N,S83T,D96P,D102E 
(y) D57G,N94K,D96L,L97M 
10 ( z) E87K, G91A, D96R, IIOOV, E129K, K237M, I252L, P256T, G263A, L264Q 
(aa) E56R,D57G,S58F,D62C,T64R,E87G,G91A,F95L,D96P,K98I 

Strains: 

Expression system host: 

Saccharomyces 'cezevisiae YNG318: MATa Dpep4[cir*] ura3-52, ieu2-D2, 
his 4-539 

Saccharomyces cerevlslae Rad52: Strain M1533 = MATa rad52 uraJ, 
obtained from Torsten Nilsson Tillgren, Institute of Genetics, 
University of Copenhagen. 

Plasmids: 

pJS026 (see figure 3) 
pJS037 (see figure 4) 
pYES 2.0 (Invitrogen) 

Transformation selective marker 
ura3 
leu2 

30 Media 

SC-ura": 90 ml 10 x Basal salt, 22.5 ml 20% casamino acids, 9 ml 1% 
tryptophan, H2O ad 806 ml, autoclaved, 3.6 ml 5% threonine and 90 ml 
20% glucose or 20% galactose added. 

LB-medium: 10 g Bacto-tryptone, 5 g Bacto yeast extract, 10 g NaCl 
35 in 1 litre water. 

Brilliant Green (BG) (Merck, art. No. 1.01310) 

BG-reagent: 4 mg/ml Brilliant Green (BG) dissolved in water 

Substrate 1: 

10 ml olive oil (Sigma CAT NO. 0-1500) 
4 0 20 ml 2% polyvinyl alcohol (PVA) 
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The substrate is homogenised for 15-20 minutes. 
Methods : 



10 



construction of veast .vp.... 

The expression plasmxds pJS026 and pJS037 ..... 
2.0. The inducible GALl-promoter of pYES 2 o was ^'^^ 
constitutively expressed TPI (triose ohos . ""^^'""^^^ ^''"^ 

from S.ccharo:nyces cerevisiae (Albert and K -promoter 
-PPI Oenet., .1S-.3.) and T:^. ^.^::^: J"''' ' " 
restriction map of pJS026 and pJS037 is /ho ^ 
4, respectively. ^ " figure 3 and figure 
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* . . ' ^ . - . ■■ -^iav,)AUCiIL 

a suitable restriction e"Z °" 

sc ura medium is inoculated with . c 

and srovm at 30-c for 2 days The 10 n,v =«evlsiae colony 

- sc-ura- medium which is'^rcl , °3"e d'" '"'"^'""^ "° 

Mi«se"°" ' ' " tsuhs,"^/;,/"^- ^ ^= 

yeast extract (Difco) 
L-Leucin (Fluka) 

MgS04-7H20 

Trace compounds 
Vitamin solution 
H3PO, 

20% Pluronic (antifoam) 
In a total volume sqoq ^, , 

The yeast cells are fermented for 5 days at 30 

start dosage of 100 ml 70% ol„.«« / ^^^^ ^iven a 

PH-S.O is ,ept by Tdd t of Trr:?"" ^ 
rP" .or the first 22 hours Allowed by ;O rr°;- T"''" '"^ 
fermentation. Mr is ,ive„ with U air/lI„Tor /he''; 

lor the first 22 hours 



25 


400 g 




6.7 g 




12.5 g 




6.7 g 




10 g 


30 


17 g 




10 ml 




5 ml 




6.7 ml 




25 ml 
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followed by 1.5 1 air/l/min for the rest of the fermentation. 

Trace compounds: 

6.8 g ZnCl2 

54.0 g FeClz'SHzO 

19.1 g MnCl2'4H20 
2.2 g CuSOfl'SHjO 
2.58 g C0CI2 
0.62 g H3BO3 

0.024 g (NH4)6MO702r4H20 
0.2 g KI 

100 ml HCl (concentrated) 
In a total volume of 1 1. 

Vitamin solution; 

250 mg Biotin 

3 g Thiamin 

10 g D-Calciumpanthetonat 

100 g Myo-Inositol 

50 g Cholinchlorid 

1.6 g Pyridoxin 

1.2 g Niacinaraid 

0.4 g Folicacid 

0.4 g Riboflavin 

In a total volume of 11, 

Transformation of yeast 

Saccharomyces cerevisiae is transformed by standard methods, (cf. 
Sambrooks et al., (1989), Molecular Cloning: A Laboratory Manual, 
2nd Ed., Cold Spring HarjDor) 

Determination of yeast transformation frecruency 

The transformation frequency is determined by cultivating the 
transformants on SC-ura*plates for 3 days and counting the number of 
colonies appearing. The number of transformants per mg opened 
plasmid is the transformation frequency. 



Screening for positive variants with improved wash performance 

The following filter assay can be used for screening positive 

variants with improved wash performance. 
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Low calcium filter assay 
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1) Provide SC ura- replica plates (useful for , 
carrying the expression vector) with a fiJlt ""'^=^^^9 -trains 

5 (Nylon .eznbrane) and a second low proteL" 11"''!" '''''' 
acetate) on the top. ^""^^"^ filter (Cellulose 

2) spread yeast cells containing a parent Una, 

-Pase gene on the double filter an'd inc^.:re 0:%%^^ al 

10 3) Keep the colonies on the top filter bv ^ 

filter to a new plate. ^ transferring the top- 

4) Remove the protein binding filter to «n o 

5) Pour an agarose solution T ^ ^ ^"^'^ 

5 buffer PH9 and EaTk (fintr ^^"^-^^-'0.0041), loo tri 

30 as to identif^i: ^ ^ -tto. fUte" 

blue-green spots. «^P««:^ng l.pase activity in the forn of 

6) Identify colonies found in sten 51 k • 
^ for calciu. as compared to the parent lipasr' ' '^^^'^^^^^^ 

DMA seq uenri na was performed by usina anni • . 

sequence model 373A accordina to ^^o^ystems ABI dna 

Terminator Cycle Sequencing Jcit P^^ocol m the ABI Dye 

The number of colonies determines tie efficl 

vector and fragment recombination The n °^ 

active lipase activity .ives a^ ^'l^re^^^^^^^^^^^ with 

actzve and inactive genes - theoretically it L , °' 

one frameshif t that the closer to 50* 7. 1 "^^^^ated for 

Uicelihood Of wild type and framesh^^t 2I1 f"", "'^'"^ ^"^^^ 

12.5% for 3 frameshifts. ^^'^ ^ frameshif ts and 

Frameshif t mutaM-n» 

The frameshif t mutation were created • 

restriction site (in case of 5' T T " "'"^^^^ - 

ends" (in case of 3' overhana) b 7/ °' '^'"'"^ "^^^^^^V 

dNTP (deoxynucleotides : ' P^^V-erase with or without 

dGTPl. Method, for fUUn,TL °' -d 

on ri,„,e „ and defeti" '"^«red to 33 

"<■»■' on r.,„„ „ „ end. deferred to as 
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Method for assessing colonies with lipase activity 

The number of colonies and positives (i.e. with lipase activity) are 
calculated as the average of 3 plates. 

The cultivation condition and screening condition used is the 
following: 

1) Provide SC Ura-plates with a protein binding filter (Nylon 
filter) onto the plate. 

2) Spread yeast cells containing a parent lipase gene or a mutated 
lipase gene on the filter and incubate for 3 or 4 days at 30 *C. 

3) Remove the protein binding filter with the colonies to a petri 
dish containing: An agarose solution comprising an olive oil 
emulsion (2% PVA:01ive oil=2:l). Brilliant green (indicator, 0.004%) , 
100 mM tris buffer pH 9. 

5) Identify colonies expressing lipase activity in the form of blue- 
green spots. 

EXAMPLES 
Example 1 

Testing in vivo recombination of two homologous genes 

The Saccharomyces cerevisiae expression plasmid pJS026 was 

constructed as described above in the "Material and Methods"- 

section. 

A synthetic Humicola lanuginosa lipase gene (in pJS037) containing 
12 additional restriction sites Csee figure 4) was cut with Nrul, 
PstI, and Nrul and PstI, respectively, to open the gene 
approximately in the middle of the DNA sequence encoding the lipase. 

The opened plasmid (pJS037) was transformed into Saccharomyces 
cerevisiae YNG318 together with an about 0.9 kb wild-type Humicola 
lanuginosa lipase DNA fragment (see figure 1) prepared from pJS026 
by PGR amplification. 

Further, the opened plasmid was also transformed into the yeast 
recombination host cell alone (i.e. without the 0.9 kb synthetic 
lipase Dm fragment) . 



The transformed yeast cells were grown . as described in the "Ma- 
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terials and Method"-section above, and the tran.f 

was determined as described above. '^""sformation frequency 

It was found that the transformation frequencv of ^h 
alone was very low (lo transformants per ' ^ ' ^^^^'^^ 
comparison to the transformation frequency of / h '""'"'^^ ' 
(50,000 tran,formants per mg opened pfaUd,! ^^"-^^/^-^ment 

The Plasmid/fragment was PCR amplified resulti.a . ,n 

containing fragments covering the Upase oe 

recombined plasmid/f ragments . The recombTna\ion °' 

transfoxmants were analyzed by restr^lT °^ 20 

-an.a.d.etho,3. .he result is 'dispL^^^^^^^^^^^^ 



15 Table 1 



20 



25 P5 
P6 

Nl 
N2 

30 N3 
N4 
N5 
N6 

35 P/Nl 
P/N2 
P/N3 
P/N4 
P/N5 
P/N6 
P/N7 
P/N8 



40 



45 



50 



f R SphI Hindlll Pst/"""' is^^^Mh?^?' 

fragment ^^"^ ^^tXI Nhl BstEIl Kpnl Xhol 

PI 
P2 
P3 
P4 




nd: not detennin"! "^"""^"n enzyme pattern 

As can bee seen from Table 1 in f,.-.. 

1 10 transfoxmants (equivalent to 50%) 



wo 97/07205 



29 



PCT/DK96/00343 



contained recombined DNA sequences. 4 of these 10 DNA sequences 

(equivalent to 20%) contained either a region of the wild-type gene 

recombined into the synthetic gene or a region of the synthetic gene 
recombined into the wild-type fragment. 

Example 2 

In vivo recombina tion of Humicola lanuQinosa lipase variants 

The DNA sequences of 20 variants of the Humicola lanuginosa lipase 

were in vivo recombined in the same mixture. 

Six vectors were prepared from the lipase variants (a) to (f) (see 
the list above) by ligation into the yeast expression vector pJS037. 
All vectors were cut open with Nrul. 

DNA fragment of all 20 homologous DNA sequences (g) to (aa) (see the 
list above) were prepared by PGR amplification using standard 
methods. 



The 20 DNA fragments and the 6 opened vectors, were mixed and 
transformed into the yeast Saccharomyces cerevisiae YNGSIS by 
standard methods. The recombination host cell was cultivated as 
described above and screened as described above. About 20 trans- 
formants were isolated and tested for improved wash performance 
using the filter assay method described in the "Material and 
Me thods " -s ecti on . 

Two positive transformants (named 'A and B) were identified using the 
filter assay. 

In comparison to the wild-type amino acid sequence the two re- 
combined positive transformants had the following mutations. 

A: D57G, N94K, D96L, P256T 

A is a recombination of two variants. 
originates from the vector (d) 

==== originates from the DNA fragment prepared from variant (y) 

B: D57G, G59V, N94K, D96L, L97M, S116P, S170P, N249R 
???? ««< ????? 

B is a recombination of vector (c) , DNA fragments (n) and (u) . 
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35 



40 



originates from the vector (c) 
«<< originates fro. the DNA fragment prepared fro. variant (u) 

.??? Amino acid mutation which is not a result «f u 
5 result of recombination. 

As can be seen the resulting positive variant.; h» v 
recombination two or more variants The ' T '""^'^ 

are not a result of in li' ^'^^'^ mutations marked 

Shuffled lipase variants (s e^e lLI^rvr"^" """^ 
10 mutations. Consequently, these Ltat^ 2 TT"! '''' °' ^'^^ 
:autagenesis arisen during preparation of the °' 
standard PCR amplification. fragments by 

15 Example 3 



Synthetic Humlcola lanuqlnosa linase a^r,o / • 

or filli.,-i„ a^0/31,/sie/'4e restriction e„°" 

or by site-dir,=t.d introduction of a stop coder An 
=-«.etic iipase ,a.,. o. ,00 can >.ZZTZ\ZT;: . 

technique. Smaller PCR fr.™,! I standard PCR 

primer 4548 i2eZ> pJZTZl ."T 

op), primer 2843 and primer 4548 (488bp). 

I1-5 ml (app. 0.1 mg) of vectors Blue 425 Blu» 

Blue 429, opened with P.i- x , • ' ^^"^ 

ana Blue 425 opened 1th nL ^" "°" 

- 3 ml ,app^ 0.. :rorL:;:„tr4T:25"^n~ 
;"3": :^:erc^ra?dT::i^"" "° ----«"~e 

f cut cells as displayed in Table lA. 

The number of colonies and positives (i.e with Una, 

were calculated as the averaae of 3 Jll ^ activity) 

material and Methods section '"^'''^^ 

The result of the te«?t- i u . 

tne test is shown in Table 'lA 
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Table lA 



vector + Fr^ament 


colonies 


% of colonies with active 
lipase activity 


1. Blue 42S + 425d 


774 


16% 




049 


3% 




^ / 0 


25% 


4. Blue 425 + 426 


528 


18% 


5. Blue 425 /Nru I 
+ 426 


539 


23% 


6. Blue 425 + 424 . 


139 


7% 


1. Blue 424/NruI + 
425a 


74 


32% 


8. Blue 428 f 425 


81 


12% 


9. Blue 428 wt 
fragment 


317 


37% 



Pairwise recombinations of one frameshift mutation on the vector 
and another on the fragment on the opposite side of the opening 
site, n determined by 9 plates; # determined by 6 plates. 



The first 2 rows of Table lA displays vectors and fragments with a 
frameshift on each side of the PstI site. The "mirror image" 
experiment in row 2 compared to row 1 gives a reproducible lower 
number of active colonies. The same is true for row 3 and 4 even 
though it is not as pronounced. Moving the opening site closer to 
the frameshift in the vector increases the number of actives as 
seen in row 5. This can explain the reason for the difference in 
the ''mirror image" experiments. In both cases the higher number of 
positives has the opening site cl-oser to the frameshift in the 
vector . 

It can therefore be concluded that the closer the mutation is to 
the end of the vector the higher chance of mixing. This is probably 
arising from the well known fact that free DNA ends have a high 
recombinogenic potential . Therefore it is desirable to have as many 
free DNA ends as possible to increase the mixing of the genes. This 
is for example obtained in the later example with, recombination of 
multiple overlapping fragments. 

Row 6 has a. rather low number of actives probably due to the 
location of the frameshift on the fragment exactly at the PstI 
opening site of the vector. 
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15 



ROW 7 has the franeshift of the vector clo 

and again it gives a high number of actives'' ''"'''"^ '''^ 

efficiency of ::rr::::Z:'f ''''''' reco:n.i.ation 

the following experiments ".e 11 '^"r" ^^^^^^^^ stations 

The same way as described above 0 5 ml / 

^24, Blue 625 and Blue 626 (see L.r j 

comprising stop codons at specified o.. • "''^ 

and 7.6, respectively, (perparefb, site '''' 

together with 3 ^ Upp. 0^ .g' of fral"'' "-^^enesis, were 

transformed into 100 ml SaccJLlL ^"^'""^ ^^4, 625 and 626 

cens in .... combination^- — 



^^ector + 

Blue 626 
624 

^- -Blue 624 
626 

3* Blue 625 
624 

Blue 624 
625 



Number of 
colonies 

ND 



ND 



Table IB 



ND 



ND 



20 



=olo„i„ „uh Upa« activity i/cU."" m h"" °^ 
nutations cellared to the frameshlft . ' 

--:;.ttt\v:r^^^^^^^^^^^ »utatio„a. .io. i. 

fra^eshift mutations. R„„ 3 ° than 

^= .0 ..e en. 0. -^"^JHe ^ 



* °^ colonies t/itA lipase 
^^^^^^ ^ 

40% 



12% 



75% 



10% 
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and one or two frameshift mutations in the fragmen t 

Using the same approach as described above the influence of one or 
two frameshift mutations in the vector and one or two frameshift 
mutations in the fragment were tested using vectors Blue 425, 426 
and 428 (one mutation) and vectors Blue 442/ Blue 443 (two 
mutations) and fragments 442 and 443 (two frameshift mutations) 
and fragments 424, 425, 426, 427, 428 (one mutation) and wild-type 
(no mutation) 

The vectors Blue 442 and 4 43 are double frameshift mutations: Blue 
442 = 428+429 and blue 443 = 427+429 (see Figure 7). 

Recombination was performed by transforming 0.5 ml vector (app. 0.: 
mg) opened with PstI and 3 ml PCR-fragment (app. 0.5 mg) into 100 
ml Sacchromyces cerevisxae YNG318 competent cells. 

The result of the test is shown in Table 2A and Table 2B 



Table 2A 



Vector + 
Fragment 


Number of 
colonies 


.% of colonies with active 
Lipolase 


1. Blue 425 + 
442 


142 


15% 


2. Blue 425' + 
443 


144 


14% 


3. Blue 426 + 
442 


42 


42% 


4. Blue 426 -i- 
443# 


77 


20% 


5. Blue 428 + 
443 


115 


3.8% 



One frameshift mutation on the vector and two on the fragment on 
each side of the opening site. # determined by 6 plates. 



Table 2B 



Vector + Fragment 


Number of 
colonies 


% of colonies with active 
Lipolase 


Blue 442 + 424 


137 


0.5% 


Blue 442 + 426 


118 


1.1% 


Blue 442 + 427# 


125 


1,3% 


Blue 443 + 425 


540 


2.5% 
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15 



20 



Blue 443 + 426 
Blue 443 + 428 




on 



the fragment. # Itt^i^T^^T^^p^^^^P^^ 

Table 2A shows a rather high number of colonic • 
activity even with a total of . / «°lonies with lipase 

£ra«e=hlft on the vector ia located 

than u„e 3 pMa" rt~' f 

on the vector i. located further away tlJl.T ^-">«h«t 
the fra«e3hlft on the fra^ent „.Un, the active^ 
a« not related to the opening site (see , 

very low nu^er of actives are le JeT H '"^ ' ^ 

^ra^^eshifts located on the ve totX t oTthe''"^ ' 

are mosaics of the "parent" DH* colonies 

related to the opening sit ^^e IT. 

'raMlt ttt^r" "° - 

t-ne test is shown in Table 3 



Vector + Fragment 

Blue 428/pstI + 

Blue 429/pst # 

2^"!j28/pst + Blue 
429/PstI + 442 

Blue 442/pstI + 428 + 
429 

Blue 443/pstI + 427 + 
428 



Table 3 

Number of 
polonies 

13 



273 
228 
229 



* of colonies with active 
Lipol ase 

15% 



4.2% 
0.8% 



fe- T 1-6% 

. r « Of colonies are seen for the control e.peri„e„t 



25 
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two frameshifts each corresponding to the frameshift on each 
vector. Via a tripartite recombination 4.2% actives are created. 
With two fragments with each one frameshifts and a vector with the 
same two frameshifts very few actives are found. 

Recombination with vectors opened at different sites 

Opening the vector in one side instead of approximately in the 

middle still gives good recombination as shown in Table 4, Two 

vectors opened at different sites can also recombine to some extent 
(compare with the vector controls in table 13) . 



Table 4 



Vector + Fragment 


Nunber of 


% of colonies with active 




colonies 


Lipolase 


Blue 428/xho + 429 


160 


11% 


Blue 428/xho+Blue 


35 


6.3% 


429/pst# 





Opening of the vector in one side instead of in the middle. # 
determined by 6 plates. 



Recombination at different concentrations of vector and fragment 

The relative concentration of vector to fragment do influence the 
percentage of positive colonies as can be seen in Table 5. 



Table 5 



Vector + Fragment 


Number of 
colonies 


% of colonies with lipase 
activity 


0.5^1 Blue 426 •«■ 
3|xl 442 


42 


42% 


,1.5^1 Blue 426 + 
3^11 442 


21 


51% 


l.Sul Blue 426 + 
9nl 442 


34 


26% 


1.5^1 Blue 426 + 
3fil 427 


230 


2.8% 


1^1 Blue 442 + Ijil 
425 


224 


1.16% 


l^il Blue 442 + 2nl 


429 


0.9%, 
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varying tne couc.uL.aL.on ot the vectoi-^F^^S^si^ 



The size of the fragment also influences the recombin.i- 

as seen in Table 6. tne recombination result 



10 



15 



Table 6 




Transformation with unopened vectors sho..« 

recombination (Table 7). ^ """^ ^^^^^^ °f 



Table 7 



Plasmid 



Blue 428 + Blue 

4?Q 



Number of 
colonies 

887 



* of colonies with active 
Lipolase 

0.3% 
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Blue 426 + Blue 
425 



697 



0.7% 



Recombination of unopened plasmids. 



Example 4 

Test of 5. cerevisi ae mutants altered in recombinatjnn 

Using the same approach as described in Example 3 recombination of 
opened and unopened vectors and fragments were tested using a 
Saccharomyces cerevisiae rad52 mutant as the recombination host 
cell. The result is displayed in Table 8. 



Vector + 
Fragment 


Number of 
colonies 


% of colonies with active 
Lipolase 


Blue 428 + 429 


0 


0 


Blue 442 + 427 


0 


0 


Blue 424 + 425 


0 


0 


Blue 426 + 443 


0 


0 


Plasmid pJSO 
37 

Re comb i n a i-i r>n fc^B 


544 


100% 



The result with rad52 showed that recombination was completely 
abolished- The RAD52 function is required for classical 
recombination (but not for unequal sister-strand mitotic 
recombination) showing that the recombination of opened vector and 
fragment could involve a classical recombination mechanism. 



Excimple 6 

Recombination of m ultiple partial overlapping fragments 

In order to increase the mixing of the mutations by the 
recombination method of the invention, recombination of two 
rragments and one gapped vector were attempted. 



Table 15 


Vector + Fragment 


Number of 
colonies 


% of colonies with lipase 
■ activity 


1. pJS037/HindIII-XhoI 
+ PCR319+PCR327 


> 2000 


100% 


2. pJS037/HindIII-XhoI 
+ PCR321+PCR331 


« 2000 


s 0.2% 
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3. pJS037/HindIII-XhoI 
+ PCR319+PCR33 1 

4. pJS037/HincIIII-XhoI 
+ PCR319+PCR38 6 

5. pJS037/HindIII-xhoI 
+ PCR321+PCR386 



9. Blue 428/HinclIII- 
Xhol -I- PCR327+PCR38 5 

10. Blue 429/HindIIl- 
Xhol + PCR319+P CR3 8 6 

vl* ^^"^^ 429/HindIII- 
Xhol + PCR32H-PCR 386 

"tl" ^^^^ 442/HindIIl- 

AftOl D^o-sift..*— 



« 1500 



> 5000 



> 5000 



* 1500 



400 



« 350 



* 1500 



« 1% 



> 90% 



« 25% 




* 10% 



« 10% 



* 15% 



« 15% 



* 10% 



— •Il.A«uJ.xJ.- 
__£CR319+PCR327 

13. Blue 428/HindIII- 
Xhol -H 

14- Blue 42 9 /Hindi! I- 
Xhol + 

15. Blue 442/HindlIl- 
Xhol + 

16. Blue 428/HindIIl- 
Xhol + PCR331 

17. Blue 428/HindIII- 
Xhol -f P CR321 , 

As can be seen in Table IS i-ho ..^ 

lipase gen. u very emLlt Th , °' I'nu^inosa 

that the opened vectralon" " """" " '"""^ 

- gap .ee ^^Z^ 

L1on":r 100. o. active 



The seconci row is with 

The fra,:,ent PCR331 fra JL"aTthe ' ^^aMeshift. 

.,nx Site „Hich. .n thutr.:::, 'r^"" - - 



1 c r»/^^ 
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type fragment (see figure 3) and therefore gives about 0% of active 
lipase. The same is the case for row 3 and 6. 

In the row 4, fragment PCR386 containing a frameshift at the SphI 
site which is overlapped by wild type sequences in the gapped 
vector. The frameshift was recombined into less than 10% of the 
genes which is lower than the result for one fragment recombination 
in the last row of Table lA above. 

In row 5 a rather high mixing is observed between the 2 fragments 
each containing a frameshift and the wild type gapped vector giving 
25% active and 75% inactive lipase colonies. This is probably due 
to that the fragment PCR321 has the frameshift in the overlap 
between the 2 fragments and in the gapped region of the vector. If 

fragment PCR386 contributes to 10% inactives like in row 4, 
fragment PCR321 gives the remaining 65% inactives - therefore 
PCR386 gives 35% wt in the overlap. 

Row 7 is the "mirror image" of row 4 with the frameshift at the 
SphI site on the vector (see Figure 7) and 2 wild -type fragments 
giving an integration of the wild type fragment into more than 90% 
of the vectors. 

Row 8 shows like in row 5 that the frameshift of PCR321 in the 
overlap and gap region gives a very high number of inactive. 

In row 9, fragment PCR385 with a frameshift in the vector overlap, 
causes a very high number of inactives. 

Row 10 gives a rather high number of inactives compared to row 7 
and 4. It is not increased in row 11. 

Row 12 shows that two frameshifts on the vector gives a lower 
number of actives compared to one in row 7. 

The recombination of 3 partial overlapping fragments into a gapped 
vector is also very efficient as seen in Table 16, The last row 
with the vector alone gives very few colonies. As can be seen in 
figure 4 all fragments used are wt. In the first row in table 16, 
there are rather long overlaps between the vector and fragments, 
but in the middle row the overlap between PCR353 and 355 is only 10 
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bp long and it is still very efficiently recombined- Th • 
surprxsxng result .ay be utilized for very e.Ty Zl' 
even distantly related genes. For examole can 3 TfT ''""''"^ °^ 
from 10 different genes be made as PCR ^^"^^^^t domains 

a 10 to 20 bp overlap by primer sign n ^ "f^^^^ 
subsequently screened for the best clLT ^^^ether and 

combinations). combination (lOOO possible 



10 



Vector + Fragment 



pJS037/Pvuii-SDei 



Table 16 

Number of 
colonies 



^ of colonies with active 
Lipola se 

100% 



<= wj^Bj. 20 ~ — 



100% 



15 



20 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 
10 (E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2880 

(G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(ii) TITLE OF INVENTION: Method for preparing polypeptide variants 
15 (iii) NUMBER OF SEQUENCES: 15 

(iv) (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

20 (D) SOFTWARE: Patentin Release #1.0, Version #1.30B (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Primer 2843" , 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ACAAACATTA CGTGCACGGG 20 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
4 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Primer 4 699" 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGGTACCCGG GGATCCAC 18 
(2) INFORMATION FOR SEQ ID NO: 3: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 5164" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

60 AATTACATCA TGCGGCCC 18 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
- .(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer Riaix 
Ui) SEQUENCE DESCRIPTION: IeQ ID Not^lf 

CATTTGCTCC GGCTGCAGGG A 
10 21 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS- 
15 (A) LENGTH: 60 base oairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS; sinqle 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 

20 Ui, sKSilON^IlS ^D-SoTlf 

l^ll^rTclt S^.^ -CAGGAATT CTGCAAAAGC 
25 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 21 base oairs 
30 |B) TYPE: nucleic acid 

^ C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(11) MOLECULE TYPE: Other nucleic acid 



60 



35 



GGTCTGTACG GTCAGGAATT C 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS- 
Jl JENGTH: 19 base pairs 
B) TYPE: nucleic acid 
C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

othefnucleic acid 
(A) DESCRIPTION: /desc = "Prtmo^ Kanon 
(XI) SEQUENCE DESCRIPTION: lis ID N^t ?f 

CGTTTCGGGT GACGGGGAC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS- 
55 ^ JfNGTH: 18 base pairs 

(B TYPE: nucleic acid 
C STRANDEDNESS: single 

GGAGCAAATG TCATTTAT 

(2) INFORMATION FOR SEQ ID NO: 9: 



21 



40 



45 



50 



19 



18 



wo 97/07205 



43 



PCT/DK96/00343 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Primer 4545" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCATTGGCAA CTGTTGCCGG AGCAGACCTG CGTGGAAATG 
GGTATGATAT CGACGTGTTT TCAT 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Vector pJS026" 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .876 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 




384 



336 



288 



240 



192 



144 



96 



48 
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44 



40 



45 



50 



55 



60 



Ifs §f J ?g I ?S g - - OCC 

' i ^1 ^ - 1^ .J 

- 52 s I m SI IS - ™ - :: 

175 

- - i - =S i ?^ 51? - - 

190 

S5 s| s 5if |s - - s ?s - 
i S5 IS ?s s[ S5 ?irs S - - 

i S - ?| - - - J. ^ 

30 ?S ?Ii ^ S 52 S 5If - ^ 

255 

3, "J 5S ^1 II - - JJ^ ^ .0 .0 .C ^ 

270 

AAC ATT CCG GAT ATC PPt nno 

- - - ?IS - - «C J. „c „c ™ 



20 



25 



^^^^ 

" - - ^„ 



432' 



480 



528 



624 
672 
720 
768 



864 



285 

ACA TGT CTT TAG 
Thr Cys Leu * 

290 876 
(2) INFORMATION TOR SEQ ID NO: 11 • 

Bj TYPE: aroxno acid 
(D) TOPOLOGY: linear 

-a se. .0 

30 
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Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asd Ser 
65 70 75 80 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn T^r Asn Lvs 
85 90 95 

Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
100 105 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cvs Ser Glv 
115 120 125 ' 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asd 
1^0 135 140 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Ar| Glu His Pro Asp Tyr 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu A^a Thr Val 
165 170 175 

Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
180, . 185 190 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
195 200 205 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
210 215 220 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
"5 230 235 240 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asd 
245 250 " 255 

He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
260 265 270 

Asn He Pro Asp He Pro Ala His Leu Tro Tyr Phe Gly Leu He Gly 
275 280 285 

Thr Cys Leu * 
290 

» 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Vector pJS037- 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
^B) LOCATION:!. ,876 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



wo 97/07205 



i1 
46 



PCT/DK96/00343 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



^ m ^ s - - m III s - ™ 

s ^ z q - „ 

?| IS 12 - - jj: 

SI S ?S J| - - .0 .0 ^„ ecc 

60 

i ?2 s[is ?s ir, ss IS - - - 

80 

0^=? f g - S| - .C „C ^ 

m ?2 12 - 1| - - J. 

- - S5 Z Z ?p S m . 
g SI |f= ?S IS I- IS -gc 

i - .^js SI |c ::: .c 

CGC GTG GTG TTT ACr rrr i-^h, 

Ar, V.1 v.a III ?S l?J SI §S SI IfJ f * f ■ 

165 i7n ^ Thr Val 

175 

S S 1^ i!J ^1 1 - - «c »c 
- il S ?2 IS ^ SI SI III S- 15 1^? - JS 
i ?S S= |?J ?s S IJ? SI ?II ?S |f 5 ^l - - 

i I" s„= s St iij i| S sji ists ?s 

0^ ?S IS ?S S II sS 5If ?S St S 

51? ?| m - - cjc - 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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AAC ATT CCG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG Sfi4 
Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Glv 
275 280 285 



ACA TGT CTT TAG 
Thr Cys Leu * 
290 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
1 5 10 15 

Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
20 25 30 

Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
35 40 45 

Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
50 55 60 

Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu' Asp Ser 
^5 70 75 80 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Aso Asn Thr Asn Lvs 
85 90 ' 95 ' 

Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
100 105 110:.. 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser. Glv 
115 120 125 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
130 135 '140 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
145 150 155 '160 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
165 170 175 

Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
180 185 190 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
195 200 205 

^^-JJ '^^^ '^^^ Tyr Arg He Thr His Thr Asn Asp He 

210 215 220 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
225 230 235 240 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
245 250 . 255 



876 
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Ue val ,ys ue =lu Oly He Asp Thr Oly 31y Asn 01„ p„ 

'^"^ 270 
Asn lU ..p xie Pro Ma Us .eu Trp Tyr P.e .eu lu cXy 

Thr Cys Leu * 
290 



20 



(2) INFORMATION FOR SEQ ID NO: 14: 

15 CHARACTERISTICS: 

(A) LENGTH: 864 base cairs 

;B) TYPE: nucleic fli§"" 

C STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE- 

(3) STRAIN: Pseudomonas sp. 
2^ (ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION:!.. 864 

(ix) FEATURE: 
^ (A) NAME/KEY: CDS 

(B) LOCATION:!, .864 

35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 14- 

^ m 12 S? ^ - - - go J. «c 

" S5 m - - - .c 

« - ?n S - - J. ^cc 

so - 1 S m IS - - e« 

60 

f 5,^ S SIf 12 ?If tS =f MC CCC OTC MC 

55 ®^ 70 I'ys Pro Lys Val Asn 

CTG TTC GGC CAC ^ ^ 

I'eu Phe Gly His Je'r §?J JJf CGC TAC GTT GCC GCC 

85 ^ lie Arg Tyr Val Ala Ala 

60 GTG CGC CCG GAT err rrr nr-r^ « 

V. v^f s I- ?s - ?n if= g - - 

« f I s z ^2 J- - z tr. ft f = - ^ 

115 *- ^1" Val Pro Glu Gly Ser 



48 
96 
144 
192 
240 
288 
336 
384 
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GCC AGC GAA GCG ATT CTG GCC GGG ATC GTC AAT GGT CTG GGT GCG CTG 
Ala Ser Glu Ala He Leu Ala Gly He Val Asn Gly Leu Gly Ala Leu 
130 135 

ATC AAC TTC CTT TCC GGC AGC AGT TCG GAC ACC CCA CAG AAC TCG CTG 4 80 

lie Asn Phe Leu Ser Gly Ser Ser Ser Asp Thr Pro Gin Asn Ser Leu 
145 150 155 160 

GGC ACG CTG GAG TCA CTG AAC TCC GAA GGC GCC GCA CGG TTT AAC GCC c;,p 
Gly Thr Leu Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala 
165 170 175 

CGC TTC CCC CAG GGG GTA CCA ACC AGC GCC TGC GGC GAG GGC GAT TAG S7fi 
Arg Phe Pro Gin Gly Val Pro Thr Ser Ala Cys Gly Glu Gly Asp Tyr 
180 185 190 ^ 

vl? vl? fJ? "^^"^ "^^^ GGC ACC AGC CCG CTG 624 

Val Val Asn Gly Val Arg Tyr Tyr Ser Trp Arg Gly Thr Ser Pro Leu 
195 200 205 

?hr vlt GAC CCC TCC GAC CTG CTG CTC GGC GCC ACC TCC CTG 672 

Thr Asn Val Leu Asp Pro Ser Asp Leu Leu Leu Gly Ala Thr Ser Leu 
210 ,• 215 220 

ACC TTC GGT TTC GAG GCC AAC GAT GGT CTG GTC GGA CGC TGC AGC TCC 720 
Thr Phe Gly Phe Glu Ala Asn Asp Gly Leu Val Gly Arg Cys Ser Ser 
225 230 235 240 

CGG CTG GGT ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC 768 
Arg Leu Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp 
245 250 255 

GAG GTG AAC CAG ACC TTC GGG CTG ACC AGC ATC TTC GAG ACC AGC CCG 816 
Glu Val Asn Gin Thr Phe Gly Leu Thr Ser He Phe Glu Thr Ser Pro 
260 265 270 

GTA TCG GTC TAT CGC CAG CAA GCC AAT CGC CTG AAG AAC GCC GGG CTC 864 
Val Ser Val Tyr Arg Gin Gin Ala Asn Arg Leu Lys Asn Ala Gly Leu 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Phe 
1 


Gly Ser 


Ser 


Asn 
5 


Tyr 


Thr 


Lys 


Thr 


Gin 
10 


Tyr 


Pro He 


Val 


Leu 
15 


Thr 


His 


Gly Met 


Leu 
20 


Gly 


Phe 


Asp 


Ser 


Leu 
25 


Leu 


Gly 


Val Asp 


Tyr 
30 


Trp Tyr 


Gly 


He Pro 
35 


Ser 


Ala 


Leu 


Arg 


Lys 
40 


Asp 


Gly 


Ala 


Thr Val 
45 


Tyr 


Val 


Thr 


Glu 


Val Ser 
50 


Gin 


Leu 


Asp 


Thr 
55 


Ser 


Glu 


Ala 


Arg 


Gly Glu Gin 
60 


Leu 


Leu 


Thr 
65 


Gin Val 


Glu 


Glu 


He 
70 


Val 


Ala 


He 


Ser 


Gly 
75 


Lys Pro 


Lys 


Val 


Asn 
80 


Leu 


Phe Gly 


His 


Ser 
85 


His 


Gly 


Gly 


Pro 


Thr 
90 


He 


Arg Tyr 


Val 


Ala 
95 


Ala 
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vax AT, Pro MP uu val Ala Ser Val Thr 5.r He Oly Ma ai. 
- 110 
Lys Cly se. AI. Thr Ma Asp P.e XI. A., Cl„ Val 

Ma s„ oxu Ma .eu Ma xle Val Aa„ cjy 

ri. A3„ PHe U„ se. =ly ser Ser Ser Asp ..r Pro c.„ Asn Ser .e„ 

Oly ™r ser .eu A.„ Ser Clu Cly Me Me Ar, P.e A.„ 2 

175 

Phe pro an c.y Val Pro Ihr Ser Ma Cys Cly „u Cly Asp Tyr 
vex vax Asn «y VaX Ar, Tyr Xyr Ser Trp Ar, CXy TJr s!r Pro .e„ 
Thr as; VeX p„ ^^^^^ 

Thr P.. Oly P.e =x„ Me As„ Asp Oly .eu VeX Cly Ar, Cys Ser Ser 
.eu CXy Met VaX XXe Ar, Asp Asn Tyr Ar, Met Asn „.a .eu 21 

Slu vex As„ CX„ ..r PHe SXy .eu T.r Ser XXe P., sxu Thr s^r Pro 

270' 

vex ser Vex Tyr Ar, .Xn .X„ Ma Asn Ar, .e„ .ys Asn Ma .Xy .e„ 

2o5 
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PATENT CLAIMS 

!• A method for preparing polypeptide variants by shuffling 
different nucleotide sequences of homologous DNA sequences by in 
5 vivo recombination comprising the steps of 

a) forming at least one circular plasmid comprising a DNA sequence 
encoding a polypeptide, 

b) opening said circular plasmid(s) within the DNA sequence (s) 
10 encoding the polypeptide (s) , 

c) preparing at least one DNA fragment comprising a DNA sequence 
homologous to at least a part of the polypeptide coding region on at 
least one of the circular plasmid(s), d) introducing at least one 
of said opene.d plasmid (s), together with at least one of said 

15 homologous DNA fragment (s) covering full-length DNA sequences 
encoding said polypeptide (s) or parts thereof, into a recombination 
host cell, 

e) cultivating said recombination host cell, and 

f) screening for positive polypeptide variants. 

20 

2. The method according to claim 1, wherein more than one cycle of 
step a) to f) are performed. 

3. The method according to claims 1 and 2, wherein two or more 
opened plasmids are shuffled with one or more homologous DNA 
fragments in the same shuffling cycle. 

4. The method according to any of claims 1 to 3, wherein the opened 
plasmid (s) is (are) gapped. 

5. The method according to any of claims 1 to 4 wherein the ratio 
between the opened plasmid (s) and homologous DNA fragment (s) are in 
the range from 20:1 to 1:50, preferable from 2:1 to 1:10 (mol 
vector :mol fragments) with the specific concentrations being from 1 
pM to 10 M of the DNA. 

6. The method according to any claims 1 to 5, wherein 2 or more, 
preferably from 2 to 6, especially 2 to 4 of the DNA fragments have 
partially overlapping regions. 



30 



35 



40 



30 



35 
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7. The method according to claim 6, wherein th*. n., , 

Of the DNA fragments lies in the range frol s to 117"'''' 

from 10 bp to 500 bp, especially 10 bp to Too bp '"'^^^'^^ 

5 8. The method according to any of claims 1 and 8 wh • 

one cycle of step a) to f) is bac.crossin with ;he ''''' 
DNA fragments. ^ ^ initially used 

9. The method according to any of clain,« -i ^ 

sequence (s) encoding the polypeptide (s) . °' ™* 

10. The method according to any of cl.in,. •, . 
Plasndd(s) is (are) opened clos/^ 

5 encoding the poly^ept^Lu. ' '"^ =e,ue„ce,3, 

, ™ie .o. higi .ediirioi^:::::::.™ 

12. The method according to anv ^i • 

polypeptides producible fro^fH ' ^° "^^^^^ t:he 

t^iwuuciDxe from the input DNA semi^nr^^^ 

proteins with biological activity. ^^-J^^hces are enzyines or 

13. The method according to claim 12, wherein th» , 

enzymes .elected from the group incTudino ""^^^P"''" "e 

cutinases, cellulases, amylases, peroxrist oxiT'^""' 

P=toxiaases, oxidases and phytases. 

14. The method according to claim 12, wherein th. , 

proteins with biological activity selected from th «^ 
insulin, ACTH, glucagon, soma^ostatir 

parathyroid hormone, pii.enttrv ho " thymosin, 

poietin, luteinizing /^'"'"^ hormones, somatomedin, erythro- 

"leasing f"' a tTZettT^ 

hormone, relaxin, interf^ton thr^* ' "^""^"ing 

terfaron, thrombopoietin (TPO) and prolactin. 

15. The method according to any of claims i to 13 . • 
one of the initially used input 0«A se^^ nces L TT. " 
sequence, such as a D»A se<^ence coding^oT" " / " '"'^^ ™* 
particular lipases, derived fr„™ f . , "ild-type enzymes, in 

SP-. in oartic,,,,. ^">9i. such as «,^icola 
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lanuginosa DSM 4109. 



16. The method according to claim .15, wherein at least one of the 
input DNA sequences is selected from the group of vectors (a) to (f) 
and/or DNA fragments (g) to (aa) coding for Humxcola lanuginosa 
lipase variants. 

17. The method according to any of claims 1 to 13, wherein at least 
one of the initially used input DNA sequences is a wild-type DNA 
sequence, such as a DNA sequence coding for wild-type enzymes, in 
particular lipases, derived from filamentous fungi of the genera 
Absidia, Rhizopus, Emericella, Aspergillus, Penicillium, 
Eupenicillium, Paecilomyces, Talaromyces, Thermoascus and 
Scierocieista. 

18. The method according to any of claims 1 and 13, wherein at least 
one of the initially used input DNA sequences is a wild-type DNA 
sequence, such as a DNA sequence coding for wild-type enzymes, in 
particular lipases, derived from bacteria, such as Pseudomonas sp., 
in particular Ps. fragi, Ps. stutzeri, Ps. cepacia^ Ps. fluorescens, 
Ps, plantarii, Ps. gladioli, Ps. alcaligenes, Ps. pseudoalcaligenes, 
Ps. mendocina, Ps. auroginosa, Ps. glumaer Ps. syringae, Ps. 
wisconsinensis, or a strain of Bacillus sp., in particular B. 
subtilis, B. stearothermophilus or or B. pumilus, or or a strain of 
Streptomyces sp., in particular S. scabies, or a strain of 
Chromobacterium sp. in particular C. viscosum. 

19. The method according to any of claims 1 to 13, wherein at least 
one of the initially used input DNA sequences is a variant DNA 
sequence, such as a DNA sequence coding for a variant enzyme, in 
particular lipase variants, derived from yeasts, such as Candida 
sp., in particular Candida rugosa, or Geotrichum sp. , in particular 
Geotrichum candidum. 

20: The method according to any of claims 1 to 19, wherein the 
homologous input DNA sequences are at least 60%, preferably at least 
70%, better more than 80%, especially more than 90%, and even up to 
100% homologous. 

21. The method according to any of claims 1 to 20, wherein the 
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recombination host cell is a eukaryotic cell su.h . 

or a plant cell. ' ^""^^ " ^ ^^^al cell 

22. The method according to claim 21, wherein said f , 
5 yeast . cell from the group of cell o' . . ^ 
particular strains of sj.....,:::\-^^^^^^ - 

or .e^os^r, sp., „ _ I paticuxt "/ 

Trichoderma sp.. t^^i^^ticuiar F. oxysporim, or 

15 23. The method according to any of clai™. i . o 

Plasmid ONA sequence (s, coding for the i "''^^''^ 
operably linked to a replication se^:j^^^ Polypeptide (s) isUre) 

24. The method according to claim 23, wherein , 

20 sequence (s) encoding the polypeptide (s) il, ""^""^^ 

functional promoter sequence ^^'^^-^ ^° a 

25. The method according to claim 24, wherein th. \ • 
expression plasmid wnerean the plasmid is an 

!5 

26. The method according to claim 25, wherein th 

plasmid. is PJS026 or pJS037 wherein the expression 

Title: Method for prepari;g polypeptide variants 
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54 47 f^CCCTTGTGCTGTTCTTTGTCTCTGCGTGGACGGCCTTGGCCAGTCCTATT 
1 MRSSLVLFFVSA ~w"~T~"A~'L"~A~'r~P~"r 

5507 2fIJf^f^f^^5^CGCAGGATCTGTTTAACCAGTTCAATCTCTTTGCACAGTATTCTGCA 

5567 

41 AAYCGKN NDAP ^a'T't'T'i'TT't'T ^-^^ 

5627 ^!Jf5If^555^ffTA^^^GGCGGATGCAACGTTTCTCTACTCGTTTGAAGACTCT 

61 N A C P E v'TTTTTTT'I'TTTT'd'T ^-^^ 

5687 ^f^f!fff5f^If.Jf^55ff53TCCTTGCTCTCGACAACACGAACAAATTGATCGTCCTC 

81 6 V G""D"TTT"r"L""A"Z"''D""N""rT"K"T"rT"L" ^-^^ 

5747 f 5TCTCGTTCCATAGAGAACTGGATCGGGAATCTTAACTTCGACTTGAAA 

101 s *F . R "G""r"R""s""r"r"N""w"''r"G""N""r"N""r"r"Z"K" 

5807 f^!^!f^5^J][^*^^TCCGGCTGCAGGGGACATGACGGCTTCACTTCGTCCTGGAGG 

121 E I N D 1 C S r~C~~R"~G''*'H'~D~"G'~r"T~s"r~w'~R" ^l^^ 

5867 !5!S!^fS5f^!^^f^^^'^*^'^^°^^^^°°*^^TGCT6TGAGGGAGCATCCCGACTAT 

141 s V A D T L r'TTVI'TTTTTTT'd'T 

5927 S°55!ff^fn5^CCGGACATAGCTTGGGTGGTGCATTGGCAACTGTTGCCGGAGCAGAC 

161 R V V f-TTT's^TTTTTTTTTTT'd" ^-^^ 

5987 5!fS!f2^^°+^^^^^^''A'^C^ACGTGTTTTCATATGGCGCCCCCCGAGTCGGAAAC 

181 L rTTTTTTTTTTTTTTTVTT 

604 7 f J^Iin^S^^TTCCTGACCGTACAGACCGGCGGAACACTCTACCGCA^ 

201 R \ rT"E""r"r"rT"Q""T"*G""G"TT"rTT"r"H" 

6107 ^f2^!f^!^^3fJ5253AGACTCCCGCCGCGCGAATTCGGTTAaVGCCATTCTAGCC^ 

221 T \ D i'~v''7~r'V'v'VT*r'VT~T~r'7~T"s"v ^-^^ 

6167 2^fIffI^^^IS^f5^^^^CCCTTGTCCCCGTCACCCGAAACGATATCGTGAAGATA 

241 E Y w i'~K~"i"T"r"L~v""rTT~R'T"D~rTT"r 

6227 f^Jff^^f^^^f JfA^"^<^<3CGGCW^TAACCAGCCTAACATTCCGGATATCCCTGC^ 
261 E G I D T"r"G'"r'N"N""Q~T'N'""""* * ^^^^ 



I P D I P A H 



^ o n ^'r^TGGTACTTCGGGTTAATTGGGACATGTCTTTAG 
6287 + H _ — ^. ^ 6322 

281 L W Y F G L I G T C L * 



Fig. 1 
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5447 -l^t^'l^.^^^r.t^^^^^^ 

" sL? ^ ^ ^ V L r r v-s-r-w-TTTT-s"^-r-x- 



5506 



5507 ~!Jf?f???!f!J?^^^^CTfTTAACCAGTTCMTC^^^ 

21 R R E V s Q D L F N'TVTTTTTTrT 

5567 

5627 ~!S^?ff5f2^f?^^A^f CSGATGCAACGTTTCTC^^^^^ 

^ A C P E V E K A d'TVT'Z'TTTT'd'T '''' 
5687 ff-JIfffff^IfJf^^^^^^^y^'^^CTCT^^^ 

81 G V C 0 V ^T^^c r L A ruTTrTTrTT 'I'' 
5747 ~!I!ff?!??5!2?°^^ATAGAGA^^^^^ 

S F R G S R S I E N^w'T'g^n'TTTTTT ''^^ 
5807 ?^^ff^!f^ff:!!ff!f^^°f5CAGGGGACAT^^^^^ 
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